OLAP query optimization : A Framework for Combining Rule-Based and Cost-Based Approaches
نویسندگان
چکیده
To optimize queries in relational databases, two categories of optimization techniques have been proposed : the Rule-Based Approach (RBA), and the Cost-Based Approach (CBA). In the RBA, the optimizer uses rule transformations using the relational algebra. In the CBA, the optimizer uses a cost model to estimate the potential cost of each operation using statistics about the database and the tables involved in the query. Usually both categories are implemented by commercial DBMSs and are often intermixed. In multidimensional databases however, most of query optimization techniques follow only the CBA to select optimization structures such as : materialized views, advanced indexing schemes and data partitioning. No approach has been proposed yet to rewrite OLAP queries using a multidimensional algebra. In this paper, we show that the RBA can be applied to multidimensional databases by rewriting each OLAP query to obtain an efficient rewritten query that can be executed using a CBA. In particular, we show that the RBA can be used to take into account one of the specificities of OLAP which is the visualization of the OLAP query result. We propose a multidimensional algebra that represents the core of our RBA optimization, and we show how rewritten queries can be processed using the CBA proposed for multidimensional databases.
منابع مشابه
Proposing a Novel Cost Sensitive Imbalanced Classification Method based on Hybrid of New Fuzzy Cost Assigning Approaches, Fuzzy Clustering and Evolutionary Algorithms
In this paper, a new hybrid methodology is introduced to design a cost-sensitive fuzzy rule-based classification system. A novel cost metric is proposed based on the combination of three different concepts: Entropy, Gini index and DKM criterion. In order to calculate the effective cost of patterns, a hybrid of fuzzy c-means clustering and particle swarm optimization algorithm is utilized. This ...
متن کاملMinimizing the MOLAP/ROLAP Divide: You Can Have Your Performance and Scale It Too
Over the past generation, data warehousing and online analytical processing (OLAP) applications have become the cornerstone of contemporary decision support environments. Typically, OLAP servers are implemented on top of either proprietary array-based storage engines (MOLAP) or as extensions to conventional relational DBMSs (ROLAP). While MOLAP systems do indeed provide impressive performance o...
متن کاملSemiautomatic Image Retrieval Using the High Level Semantic Labels
Content-based image retrieval and text-based image retrieval are two fundamental approaches in the field of image retrieval. The challenges related to each of these approaches, guide the researchers to use combining approaches and semi-automatic retrieval using the user interaction in the retrieval cycle. Hence, in this paper, an image retrieval system is introduced that provided two kind of qu...
متن کاملAlgorithms for data warehouse design to enhance decision-making
On-line analytical processing (OLAP) queries are strongly affected by the amount data needed to be accessed from the disk. Therefore, there is a need to employ techniques that can facilitate efficient execution of these queries. There has been a lot of work to optimize the performance of relational data warehouses. Among the two fragmentation techniques, vertical fragmentation is often consider...
متن کاملCost Modeling and Estimation for OLAP-XML Federations
The ever-changing data requirements of today’s dynamic businesses are not handled well by current OnLine Analytical Processing (OLAP) systems. Physical integration of unexpected data into OLAP systems is a long and time-consuming process, making logical integration, or federation, the better choice in many cases. The increasing use of XML, e.g. in business-to-business (B2B) applications, sugges...
متن کامل